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generating a motion vector field of video pixels of a 
current frame to be coded based on a reference frame; 

obtaining segmenting information defining the division of 
the current frame into image segments; 

coding the motion vector field of an image segment S^ to 
provide compressed motion information comprising a 
motion coefficient vector c k comprising a set of motion 
coefficients c,, such that c f -*4(x,y) represents the motion 
vector field, whereby f £ (x,y) defines a certain set of 
basis functions, wherein i is an index defining a cor- 
respondence between each basis function and a corre- 
sponding motion coefficient; 

predicting a motion vector field of a subsequent image 
segment with a set of predicted motion coefficients 
p,- based upon the previously generated motion coeffi- 
cient vector c^ 

generating for the subsequent segment S L a set of refine- 
ment motion coefficients i £ corresponding to the differ- 
ence between the set of motion coefficients c t and a set 
of predicted motion coefficients p, such that p £ +r £ =c £ . 

Another aspects of the invention comprise corresponding 
video codec, mobile station, telecommunications network 
and computer program product. 
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ADAPTIVE MOTION VECTOR FIELD 

CODING 

The present application is a continuation-in-part of patent 
application Ser. No. 09/371,641, filed on Aug. 11, 1999, the 5 
contents of which are incorporated by reference herein. 

FIELD OF THE INVENTION 



The present invention relates to video compression. In 
particular, the invention relates to coding of an estimated 
motion field and to generating motion information in a video 
sequence. 

BACKGROUND OF THE INVENTION 

Motion compensated prediction is a key element of the 
majority of video coding schemes. To describe the operation 
of motion compensated prediction it should be appreciated 
that each digital image contains certain set of pixels corre- 
sponding to certain parts of the image. Each pixel may be 
represented, for example, as intensities of Red, Green and 
Blue (RGB color system) or as intensities of the luminance 
and two chrominance components. 

FIG. 1 shows illustratively two segments of an image, 
and S £ , each showing a set of pixels 10 to 15 at old locations, 
that is in a previous image of a sequence of images. The new 
locations of these pixels in a current image are shown as 
positions 10' to 15'. The change of their location, that is their 
motion, defines respective motion vectors v\ to v 3 k and v\- 
V 3 , of the pixels in these segments. At the simplest, the 
segments are squares or rectangles. Attentively, and in 
legacy schemes, they may also be of an arbitrary form, as 
shown in FIG. 1. 

FIG. 2 is a schematic diagram of an encoder for com- 
pression of video sequences using motion compensation. 
Essential elements in the encoder are a motion compensated 
prediction block 1, a motion field estimation block 2 and a 
motion field coder 3. The operating principle of motion 
compensating video coders is to compress the prediction 
error E M (x, y), which is a difference between the incoming 
frame I„(x, y) being coded, called the current frame, and a 
prediction frame I„(x, y), wherein: 



E n {x, y)=I n (x, y)-l n (x, y) 



(1) 



The prediction frame I„(x, y) is constructed by the motion 
compensated prediction block 1 and is built using pixel 
values of the previous, or some other already coded frame 

denoted I re y(x, y), called a reference frame, and the motion 
vectors of pixels between the current frame and the refer- 
ence frame. Motion vectors are calculated by the motion 
field estimation block 2 and the resulting vector field is then 
coded in some way before being provided as an input to the 
prediction block 1. The prediction frame is then: 



y)=K e lx+Ax{x, y), y+Ay(x, y)~\ 



(2) 



Ax(x, y) and Ay(x, y) are the values of horizontal and 
vertical displacement of pixel in location (x, y) and the pair 
of numbers [Ax(x, y), Ay(x, y)] is called the motion vector 
of that pixel. The set of motion vectors of all pixels in the 
current frame I„(x, y) is called a motion vector field. The 
coded motion vector field is transmitted as motion informa- 
tion to the decoder together with encoded prediction error 
information. 

In the decoder, shown in FIG. 3, the current output frame 
I„(x, y) is reconstructed by finding the pixels' prediction 
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I„(x, y) in the reference frame I re /x, y) and adding a decoded 
prediction error E„(x, y). The motion compensated predic- 
tion block 21 generates the prediction frame using the 

received motion information and the reference frame l re /x, 
y). The prediction error decoder 22 generates the decoded 
prediction error E„(x, y) for adding to the prediction frame, 

the result being the current output frame I„(x, y). 

The general object of motion compensated (MC) predic- 
tion is to minimize amount of information which needs to be 
transmitted to the decoder together with the amount of 
prediction error measured, e.g., as the energy of E„(x, y). 

The document H. Nguen, E. Dubois, "Representation of 
motion information for image coding". Proc. Picture Coding 
Symposium '90, Cambridge, Mass., Mar. 26-18, 1990, 
pages 841—845, gives a review of motion field coding 
techniques. As a rule of thumb, reduction of prediction error 
requires a more sophisticated motion field model, that is, 
more bits must be used for its encoding. Therefore, the 
overall goal of video encoding is to encode the motion 
vector field as compactly as possible while keeping the 
measure of prediction error as low as possible. 

The motion field estimation block 2, shown in FIG. 2, 
calculates motion vectors of all the pixels of a given image 
segment minimizing some measure of prediction error in 
this segment, for example square prediction error. 

Due to the very large number of pixels in the frame, it is 
not efficient to transmit a separate motion vector for each 
pixel. Instead, in most video coding schemes, the current 
frame is divided into larger image segments so that all 
motion vectors of the segment can be described by few 
parameters. Image segments may be square blocks, for 
example, 16x16 and 8x8 pixel blocks are used in codecs in 
accordance with international standards ISO/IEC MPEG-1, 
MPEG-2, MPEG-4 or ITU-T H.261 and H.263, or they may 
comprise arbitrarily shaped regions obtained for instance by 
a segmentation algorithm. In practice, segments include at 
least few tens of pixels. 

In order to compactly represent the motion vectors of the 
pixels in a segment, it is desirable that the motion vectors are 
described by a function of few parameters. Such a function 
is called a motion vector field model. A known group of 
models is linear motion model, in which motion vectors are 
represented by linear combinations of motion field basis 
functions. In such models, the motion vectors of image 
segments are described by a general formula: 



N 



&x(x, y) = ^ Cifi(x, y) 



(3) 



;=i 



N+M 



Ay(x, y) = 2_j Cifi{x, y), 



i=N+l 



where parameters c i are called motion coefficients and are 
transmitted to the decoder. In general, the motion model for 
a segment is based on N+M motion coefficients. Functions 
f/x, y) are called motion field basis functions which and are 
known both to the encoder and decoder. Known motion field 
estimation techniques vary both in terms of the model used 
to represent the motion field and in the algorithm for 
minimization of a chosen measure of the prediction error. 

Both the amount and the complexity of the motion varies 
between frames and between segments. In one case some of 
the content of the image may be rotated, skewed and shifted 
from one side of the image to the opposite side of the image. 
On the other hand, in another case a video camera may turn 
slowly about its vertical axis so that all the pixels move 
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slightly in horizontal plane. Therefore, it is not efficient to 
always use all N+M motion coefficients per segment. 

One way to reduce motion information is simply to reduce 
the number of motion coefficients from the motion field 
model that models the motion of pixels' locations from one 5 
image to another. However, the prediction error tends to 
increases, as the motion field model becomes coarser. 

For every segment, it is necessary to determine the 
minimum number of motion coefficients that yields a satis- 
factorily low prediction error. The process of such adaptive 10 
selection of motion coefficients is called motion coefficient 
removal. This process is performed in the encoder by the 
motion field coding block 3, see FIG. 2. It is performed after 
motion field estimation performed by the motion field esti- 
mation block 2. 15 

In future, digital video transmission will be possible 
between wireless mobile terminals. Usually such terminals 
have limited space for additional components and operate by 
a battery so that they are likely not accommodate computing 
capacity comparable to fixed devices such as desktop com- 20 
puters. Therefore, it is crucial that the motion field coding 
performed in a video coder is computationally simple so that 
it does not impose an excessive burden on the processor of 
the device. Additionally, the encoded motion field model 
should be computationally simple to facilitate later decoding 25 
at a decoder in a receiving (mobile) terminal. 

Methods for performing motion estimation with different 
models and selecting the most suitable one are proposed in 
the documents H. Nicolas and C. Labit, "Region-based 
motion estimation using deterministic relaxation schemes 30 
for image sequence coding," Proc. 1994 International Con- 
ference on Acoustics, Speech and Signal Processing, pp. Ill 
265—268 and P. Cicconi and H. Nicolas, "Efficient region- 
based motion estimation and symmetry oriented segmenta- 
tion for image sequence coding," IEEE Tran. on Circuits and 35 
Systems for Video Technology, Vol. 4, No. 3, June 1994, pp. 
357—364. The methods attempt to adapt the motion model 
depending on the complexity of the motion by performing 
motion estimation with different models and selecting the 
most suitable one. The main disadvantage of these methods 40 
is their high computational complexity and the small number 
of different motion field models that can be tested in 
practice. 

Yet another method is described in W097/1 6025. A video 
codec includes a motion field coder for minimizing the 45 
number of motion coefficients of a motion vector field. In the 
coder, a first block includes means for forming a new matrix 
representation of the motion vector field. The new coded 
motion vector field is linear. A second main block includes 
means for merging pairs of adjacent segments if the com- 50 
bined segment area can be predicted using a common 
motion field. Merging Information is transmitted to a 
decoder. A third main block includes means for removing 
motion field basis functions. After each removing step, the 
squared prediction error is calculated and removing is con- 55 
tinued until the magnitude of the error is not acceptable. 
Final motion coefficients are calculated by solving a linear 
matrix equation. As a result, reduced number of motion 
coefficients for each segment is obtained. The motion coef- 
ficients are transmitted to the decoder. This approach allows 60 
removal of motion coefficients until a certain threshold of 
prediction error is reached. 

However, there is still a need to further reduce the 
complexity of the motion encoding process as well as the 
amount of motion information that needs to be sent to the 65 
decoder while causing minimal deterioration in the quality 
of a decoded image. 
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An objective of the present invention is to reduce the 
amount of motion field vector information produced by the 
motion field estimation block 2 by a large factor without 
deteriorating the decoded image to a significant extent. 
Another objective is to keep the complexity of the motion 
field coder low to allow practical implementation using 
available signal processors or general -purpose microproces- 
sors. 

SUMMARY OF THE INVENTION 

The invention is defined by the appended claims. 

By taking advantage of the predicted motion coefficients 
in a Motion Analyzer that forms part of the motion field 
coder of a video encoder, a better rate-distortion perfor- 
mance is achieved than with prior known solutions. 

Furthermore, a motion coefficient removal block can be 
used to compute a plurality of alternate combinations of 
motion coefficients to be used for further optimization of the 
rate -distortion performance. Preferably, the motion coeffi- 
cient removal block is adapted to implement certain cost 
function to find a combination with which the ultimate 
rate -distortion will be optimized. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a schematic diagram of an image divided in 
segments; 

FIG. 2 is a schematic diagram of an encoder according to 
prior art; 

FIG. 3 is a schematic diagram of a decoder according to 
prior art; 

FIG. 4 is a Motion Field Coder according to an embodi- 
ment of the invention; 

FIG. 5 is a Motion Analyzer according to an embodiment 
of the invention; 

FIG. 6 is a schematic diagram of a mobile station accord- 
ing to an embodiment of the invention; 

FIG. 7 is a schematic diagram of a mobile telecommuni- 
cations network according to an embodiment of the inven- 
tion. 

DETAILED DESCRIPTION 
FIGS. 1 to 3 are described above. 

In the following, an overview of the invention is provided 
to facilitate the further description of various embodiments 
of the invention. 

In accordance with a preferred embodiment of the 
invention, the motion field coder of a video encoder com- 
prises two main blocks. 

The first main block is called a Motion Analyzer 32, 
FIGS. 4 and 5. Its task is to find a new representation of the 
inputted motion field produced by the motion field estima- 
tion block 2, FIG. 2. This new representation is applied to 
the second main block, a motion coefficient removal block 
34, FIG. 4. Operations in the Motion Analyzer include a 
plurality of steps comprising matrix operations: in the first 
step the prediction frame's dependency on motion vectors is 
linearized using an approximation method. In the second 
step, a matrix A^ and a vector d^ are constructed for 
minimization of the square prediction error. In the third step, 
a diagonal system of linear equations based on factorization 
of A^ is built in order to create a flexible motion represen- 
tation in the form of a diagonal matrix R^ and an auxiliary 
vector z k . 

The Motion Coefficient Removal block 34 inputs the 
diagonal matrix R^ and the auxiliary vector z k produced by 
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to find the motion coefficients c=(c 1? c 2 , . . . , ^n+m) 
of a compressed motion vector field [Ax(-),Ay(-)] where the 
motion vectors are described by a linear motion model, the 
field being of the form: 



N 



&x(x, y) = ^CiMx, y) 



(4) 



i=i 



N+M 



Ay(x, y) = Cifi(x, y) 



i=N+l 



such that it minimizes a selected cost function, e.g. the 
Lagrangian cost: 



(5) 



Where the distortion D(S^) is the square error between the 
original and the coded segment. The rate R(S^) is equal to 
the number of bits needed to code the segment and param- 
eter X is a predefined coefficient denning the trade-off 
between the quality of the coded segment and the number of 
bits required for the compression of the segment. 



10 



15 



the Motion Analyzer block. Motion vectors of a segment are 
represented by a number of motion coefficients. For each of 
the segments, the motion coefficient removal block deter- 
mines if it is possible to simplify the motion field model 
without causing an excessive increase in reconstruction 
error. Typically, some basis functions are removed from the 
motion model, whereby fewer coefficients are required to 
describe such a simplified motion field model. 

The Motion Coefficient Removal block 34 modifies 
matrix equations involving the diagonal matrix R^ by 
removing one column of the diagonal matrix R^ and trian- 
gularizing the new system. As a result, there is one motion 
coefficient less in the equations and one term is removed 
from the vector z k . This operation corresponds to removal of 
one basis function from the motion field model. In order to 
determine a motion field model which optimizes a selected 
prediction error measure, or cost function, these operations 
are repeated until there are no basis functions remaining in 
the motion field model. Every time a basis function is 
removed, a new set of motion coefficients is evaluated by 20 
solving the matrix equations. This may be done by using any 
of the well known algorithms, for example, backsubstitu- 
tion. The final set of motion parameters, i.e. chosen to 
represent the motion of a particular segment, is the one 
minimizing the cost function. Preferably, the cost function is 25 
a weighted sum of a measure of prediction error and a 
measure of information required for decoding the image. 

For every processed segment, the Motion Coefficient 
Removal block 34 outputs selection information that defines 
the basis functions removed from the motion field model. 
Additionally, it outputs new motion coefficients correspond- 
ing to the remaining basis functions. Both the selection 
information and the new motion coefficients are transmitted 
to the decoder. 

FIG. 4 illustrates a motion field encoder 3 according to an 
embodiment of the invention. It corresponds to the block 3 

in FIG. 2. It inputs the reference frame l re j(x, y), the current 

frame I„(x, y), predicted motion coefficients, if any, and the 
motion vector field [Ax(x, y), Ay(x, y)] produced by the 40 
Motion Field Estimation block 2, FIG. 2. 

The output of the video encoder contains a compressed 
frame divided into segments defined by motion coefficients 
for a segment S^, which consists of P pixels with coordinates 
(x £ , y,), i— 1,2, ... P. The task of the Motion Field Coder is 45 
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To fulfill this task, the Motion Field Coder 3 comprises 
two main blocks, which are the Motion Analyzer block 32 
and the Motion Coefficient Removal block 34. The objective 
of the Motion Analyzer 32 is to find a new representation of 
the motion field. This new representation is used later, in the 
Motion Coefficient Removal block 34, to find motion coef- 
ficients for a given image segment in a fast and flexible 
manner. The Motion Coefficient Removal block 34 reduces 
the amount of motion information used to describe the 
motion field of a segment, which results in increase of the 
square prediction error, defined as 



(6) 



The operation of the Motion Analyzer 32 is next described 
in detail referring to FIG. 5. The operation of the Motion 
Analyzer 32 comprises the following steps: 

Step 1: Linearization of the error, block 42. In this step the 

reference frame I^yO m formula (6) is approximated using 
some known approximation method so that its dependency 
on [Ax(x, y), Ay(x, y)] becomes linear. Then the elements 
under the sum in formula (6) become a linear combination 
of motion coefficients c. 



^ 1 ( e j,l c l + e j,2 c 2 + ■■■ +ej,N+MCN+M -yjY 
7=1 



(7) 



Step 2: Construction of matrices, block 43. Minimization 
of formula (7) is fully equivalent to minimization of the 
matrix expression (E^c^— y^^E^— y^) or solving the fol- 
lowing equation: 



A/cC k -d k 



(8) 



where A k =H k T E k and d k =E k T y k . The vector y k is built in 
block 46. 



(9) 
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Step 3: Triangularization and creation of output, block 44. 
In this step equation (8) is triangularized using a known 
method to decompose A^ into a product of a lower diagonal 
matrix R k T and its transpose R^ 



(10) 



This may be carried out using Cholesky decomposition. 
An auxiliary vector z k ° is created solving 



Rk Tz k°=dk 



(11) 



for example, using backsubstitution. The vector of motion 
coefficients c k minimizing the formula (7) is now the 
unknown vector in the diagonal system 



Rk c k— Z k° 



(12) 



and can be solved when needed, for example, by using 
backsubstitution. 

The motion coefficients c k can be given as sums of 
predicted motion coefficients p^ and refinement motion 
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coefficients r^. The predicted motion coefficients are pre- 
dicted from previously generated motion coefficients and the 
refinement motion coefficients correspond to the difference 
between the predicted motion coefficients and motion coef- 
ficients calculated in the Motion Field estimation block 
(FIG. 2). In this case, equation (12) has the form: 



and, an output vector x k is created by calculating 

z k= z k- R kPk 



(13) 



(14) 



Block 47 forms the term z k ° and block 45 generates the term 
R^p^. The output of the following Motion Coefficient 
Removal block 34 becomes refinement motion coefficients 
instead of absolute motion coefficients. Otherwise output 
vector z k =z k °. 

Motion Coefficient Removal block 34 receives as input 
matrix R^ and vector z k produced by the Motion Analyzer 
block 32. Motion vectors of every segment are represented 
by N+M motion coefficients. 

For a given segment S^, the Motion Coefficient Removal 
block determines if it is possible to simplify the motion field 
model, without excessively increasing the selected error 
measure. A simplified motion field model is obtained when 
some basis functions are removed from the model in equa- 
tions (3) described in the background art of this application. 
Fewer coefficients are required to describe such a simplified 
motion field model. 

The following iterative procedure is performed in order to 
find the optimal motion vector field. 

Step A: Initial cost calculation. A Lagrangian cost for the 
segment is evaluated with the full motion model and stored 
together with the full set of motion coefficients. 

Step B: Finding the basis function with the smallest 
impact on prediction quality. Let R k n denote an nxn upper 
diagonal characteristic matrix with n basis functions remain- 
ing and R/ 1 ' 1 the same matrix with the i'th column removed, 
n sets of equations are generated each with the i'th column 
removed from the matrix R^" and the i'th element removed 
form the vector c k n : 



R k n > l c k n > l =z k n , 



n 



(15) 



All the equations generated are triangularized in a known 
manner by applying a series of multiplications of rows by 
scalars followed by additions of the rows, i.e., equation (15) 
is converted to the form: 
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Where (q,) 2 is an approximation of the increase in the 
square prediction error due to removing the i'th basis 
function from the motion model. The column yielding the 
smallest (q,) 2 when removed is the one to be removed during 
this iteration, effectively causing the i'th basis function to be 
removed from the motion model. 

Step C: Removing a basis function from the model. Anew 
matrix equation is built by selecting matrix R^' and vector 
z,"'' associated with the basis function to be removed and 
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removing the last row of the matrix and the last element of 
the vector as follows: 
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Step D: Coefficient calculation. A new set of motion 
coefficients for the reduced set of basis functions is calcu- 
lated by solving the triangular system: 



L c k " =z k 



(18) 



e.g. by backsubstitution. 

20 Step E: Cost calculation. A Lagrangian cost for the 
segment is evaluated and stored together with the set of 
motion parameters if this model is the best one so far. 

Step F: Final motion model selection. If there are still 
basis functions to be removed, steps B to E are repeated. If 
all the basis functions have been removed from the model, 
the output is generated. The output comprises selection 
information, describing which basis functions should be 
removed from motion field model, together with new motion 
coefficients corresponding to the remaining basis functions. 
Both selection information and motion coefficients are trans- 
mitted to the decoder. 

The main advantage of the present invention over prior art 
solutions is its ability to reduce the amount of motion 
35 information by a large factor without causing a large 
increase in reconstruction error. Additionally, the complex- 
ity of the overall system is low which allows practical 
implementation on available signal processors or general- 
purpose microprocessors. 

The Motion Coefficient Removal block is a very powerful 
tool for instantaneous adaptation of the motion model to the 
actual amount and type of motion in the video scene. This 
block can be used to test a large number of motion models, 
with or without motion parameter prediction. A strong 
advantage of this scheme is that it does not need to repeat the 
process of motion estimation when changing motion model 
and hence it is computationally simple. 

By using motion estimation followed by Motion Analyzer 
the motion field coder can find new motion coefficients for 
any desired model of the motion field by solving computa- 
tionally a very simple systems of linear equations. 

In the preferred embodiment, an orthonormalized affine 
motion vector field model with 6 coefficients is used. In 
55 practice, this model can handle with a high degree of 
accuracy even very complex motion in video sequences and 
yields good prediction results. 

The affine motion vector field is a motion model that can 
be used to approximate motion vectors with a set of motion 

60 coefficients. The affine motion model allows description of 
various types of motion, including translational, rotational, 
zooming and skewing movements. It comprises 6 basis 
functions, in which case, the motion vectors may be sub- 
stantially replaced by a sum involving six basis functions 

65 multiplied by motion coefficients, each motion coefficient 
computed for one particular basis function. The basis func- 
tions themselves are known to both the encoder and decoder. 
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In the Motion Analyzer block 32, linearization in step 1 is 
performed using Taylor expansion of the reference frame 

I^x, y) at every pixel (x i7 y,) where i=l,2, . . . , P (P being 
the number of pixels in the segment) around points: 

5 

y'-yt+Ayixt, y,) (19) 

Using the property that Za 2 =Z(-a) 2 , the prediction error is 
then 
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microprocessor monitors the state of the user interface UI 
and controls the digital camera block CAM. In response to 
a user command, the microprocessor instructs the camera 
block CAM to record a digital image into the RAM. Once 
the image is captured or alternatively during the capturing 
process, the microprocessor segments the image into image 
segments and computes motion field models for the seg- 
ments in order to generate a compressed image as explained 
in the foregoing description. A user may command the 
mobile station to display the image on its display or to send 



p 2 (20) 

2^ (Jref{x\, y\) + (Ax(x h yi ) - Ax(x h yi ))G x {x\, y\) + (Ay( Xi , yi ) - Ay(x h y i ))G y {x' i , y\) - /„(*,-, y,)) 



Auxiliary values g t (x, y) are calculated using the formula: 

fj(xi, y^Gxix'i, y\) when j = 1, 2, .. 



gj(x h y^ = 



fj( Xi , yi )G y (xU y\) when j = N + 1, N + 2, . . . , N + M 



the compressed image using the RF block to another mobile 
station, wired telephone, facsimile device or another tele- 
(21) 20 communications device. In a preferred embodiment, such 
transmission of image is started as soon as the first segment 
is encoded so that the recipient can start decoding process 
with a minimum delay. In an alternative embodiment, the 
where functions f^x^ y t .) are basis function as defined in mobile station comprises an encoder block ENC dedicated 
equation (4). Matrix E^ and vector y^ in equation (9) are built 25 for encoding and possibly also for decoding of digital video 
using formulae: data. 



E k = 



gi(x 2 ,y 2 ) g2(x 2 , y 2 ) 



gN+M&i, yi) 
gN+M(x 2 , y 2 ) 



gi(x P , y P ) g 2 (x P , y P ) ... g N+M (x P , y P ) _ 

I n (xi, yi) - 7 re f(x[, y[) 4- G x (x[, y[)Ax(x l , yO + G y (x[, y[)Ay(x lt yi) 
In(x 2 , y 2 ) - ~Iref{x' 2 , y 2 ) + G x (x' 2 , y' 2 )Ax(x 2 , y 2 ) + G y (x 2 , y' 2 )Ay(x 2 , y 2 ) 

I n (x P , y P ) - 7 ref (x P , y' P ) + G x (x P , y P )Ax(x P , y P ) + G y (x P , y P )A y (xp, y P ) 



(22) 



(23) 



G-^x, y) and G y (x, y) are values of the horizontal and 

vertical gradient of the reference frame I re ^(x, y) calculated 
using the derivative of the well known cubic spline inter- 
polation function. 

Matrix A^ is factorized using Cholesky decomposition 
and the system in formula (15) is triangularized using a 
sequence of Givens rotations. 

Motion coefficients for new motion models are calculated 
by solving equation (18) using a backsubstitution algorithm. 

The pixel values of l re /x, y), G^(x, y) and G y (x, y) are 
defined only for integer coordinates of x and y. When x or 
y are non-integers the pixel value is calculated using cubic 
spline interpolation using integer pixel values in the vicinity 
of x and y. 

FIG. 6 shows a mobile station MS according to an 
embodiment of the invention. A central processing unit, 
microprocessor juP controls the blocks responsible for dif- 
ferent functions of the mobile station: a random access 
memory RAM, a radio frequency block RF, a read only 
memory ROM, a user interface UI having a display DPL and 
a keyboard KBD, and a digital camera block CAM. The 
microprocessor's operating instructions, that is program 
code and the mobile station's basic functions have been 
stored in the mobile station in advance, for example during 
the manufacturing process, in the ROM. In accordance with 
its program, the microprocessor uses the RF block for 
transmitting and receiving messages on a radio path. The 



40 

FIG. 7 is a schematic diagram of a mobile telecommuni- 
cations network according to an embodiment of the inven- 
tion. Mobile stations MS are in communication with base 
stations BTS by means of a radio link. The base stations BTS 

45 are further connected, through a so-called Abis interface, to 
a base station controller BSC, which controls and manages 
several base stations. The entity formed by a number of base 
stations BTS (typically, by a few dozen base stations) and a 
single base station controller BSC, controlling the base 

50 stations, is called a base station system BSS. Particularly, the 
base station controller BSC manages radio communication 
channels and handovers. On the other hand, the base station 
controller BSC is connected, through a so-called A interface, 
to a mobile services switching centre MSC, which 

55 co-ordinates the formation of connections both from and to 
mobile stations. A further connection is made, through the 
mobile service switching centre MSC, to outside the mobile 
communications network. Outside the mobile communica- 
tions network there may further reside other network(s) 

60 connected to the mobile communications network by 
gateway(s) GTW, for example the Internet. In such external 
network, or in the telecommunications network, there may 
be located another video decoding or encoding stations, such 
as computers PC. In an embodiment of the invention, the 

65 mobile telecommunications network comprises a video 
server VSRVR to provide video data to a MS subscribing to 
such a service. This video data is compressed by using the 
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motion compensated video compression method as 
described earlier in this document. The video server may 
function as a gateway to an online video source or it may 
comprise previously recorded video clips. 

The system can be implemented in a variety of ways 5 
without departing from the spirit and scope of the invention. 
For instance: 

Different motion models can be used. 

Different cost functions can be used instead of the 

in 

Lagrangian cost given in equation (5). 

Different methods can be used to linearize the term in 
formula (6). 

Different factorization algorithms can be used in the 
Motion Analyzer (e.g. QR decomposition instead of 15 
Cholesky decomposition). 

The strategy for deciding the removal order of basis 
functions may vary (e.g. a predefined order can be used 
instead of evaluating q/s in equation (16)). 

Some motion models can be skipped in the Motion 20 
Coefficient Removal block (e.g. if some input coeffi- 
cients have negligible amplitude, algorithm may decide 
to discard corresponding basis functions). 

Cost calculation in the Motion Coefficient Removal block 
does not have to take place immediately after removal 
of a basis function but can be delayed. This means that 
in the Coefficient Removal block Steps B— D may be 
iterated instead of steps B— E. Step E (cost calculation) 
can then be performed in another loop for all the 
generated motion models (for this segment). 

Triangularization of the system in equation (15) can be 
performed using various algorithms. 

Calculation of final coefficients by solving equation (18) 
can be performed using a number of known algorithms 35 
for solving systems of linear equations. 

Prediction terms in equations (13) and (14) can be incor- 
porated alternatively by substituting motion vectors 
(Ax(x l3 y,) and Ay(x / , yj) in equation (23) by motion 
vector refinements (differences between estimated 40 
motion vectors and predicted motion vectors). 

Different interpolation methods can be used to obtain 

values of I^-^x, y), G^x, y) and G y (x, y) in non-integer 
coordinates. 

What is claimed is: 45 
1. A video codec for motion compensated encoding of 
video data comprising: 

means for generating a motion vector field of video pixels 

of a current frame to be coded based on a reference 

frame; 

means for obtaining segmenting information defining the 
division of the current frame into image segments; 

a motion field coder for coding the motion vector field of 
an image segment S k to provide compressed motion 55 
information comprising a motion coefficient vector c k 
comprising a set of motion coefficients c,-, such that 
c £ -£ £ (x,y) represents the motion vector field, whereby 
f,(x,y) defines a certain set of basis functions, wherein 
i is an index defining a correspondence between each 60 
basis function and a corresponding motion coefficient; 

prediction means for predicting a motion vector field of a 
subsequent image segment S^ with a set of predicted 
motion coefficients p, based upon the previously gen- 
erated motion coefficient vector c k ; 65 

means for generating for the subsequent segment S L a set 
of refinement motion coefficients r i corresponding to 
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the difference between the set of motion coefficients c £ 
and a set of predicted motion coefficients p £ such that 

2. A video codes according to claim 1, having a Motion 
Analyzer comprising: 

means for calculating an approximation matrix E^ and an 
approximation vector y L for the subsequent image 
segment S L ; 

means for calculating a predefined distortion measure of 
the subsequent segment as a function of E^c^— y L , c L 
being a vector of motion coefficients c, for approxi- 
mating said motion vector field of the subsequent 
image segment S^ together with the set of basis func- 
tions; 

means for generating output parameters including a 
diagonal output matrix R x and an output vector X L . 

3. A video codec according to claim 2, the motion vector 
field being represented by an affine model. 

4. A video codec according to claim 2, further comprising: 
a coefficient removal block for reducing the number of 

motion coefficients for reducing at least one motion 
coefficient in order to generate at least two alternative 
sets of motion coefficients; 
means for calculating a cost of for a set of motion 
coefficients; and 

means for selecting the set of motion coefficients that 
results in the minimum cost among the alternative sets 
of motion coefficients. 

5. A video codec according to claim 4, the cost being a 
function of a video coding error and amount of data required 
to code an image segment with the set of motion coefficients. 

6. A video codec according to claim 2, wherein matrix R x 
is arranged to be obtained by decomposing matrix 
A i =E i r E i into a lower diagonal matrix R x r and its trans- 
pose R^ so that A i =R i r R i . 

7. A video codec according to claim 2, wherein vector x L 
is arranged to be obtained by calculating d i =E i r y i and 
solving the set of equations ~R L T z L —d L . 

8. A video codec according to claim 2, wherein vector z x 
is arranged to be obtained by calculating d i =E i r y i , solving 
the set of equations R I T z L °=d L and calculating z L =z L ° —R L -p L 
when motion parameter prediction vector p^ is present. 

9. A storage device comprising a video codec according to 
claim 1. 

10. A mobile station comprising a video codec according 
to claim 1. 

11. A mobile telecommunications network comprising a 
video codec according to claim 1. 

12. A computer program product comprising a video 
codec according to claim 1. 

13. A method for motion compensated encoding of video 
data comprising the steps of: 

generating a motion vector field of video pixels of a 
current frame to be coded based on a reference frame; 

obtaining segmenting information defining the division of 
the current frame into image segments; 

coding the motion vector field of an image segment S k to 
provide compressed motion information comprising a 
motion coefficient vector c k comprising a set of motion 
coefficients c,, such that c^-f^Xjy) represents the motion 
vector field, whereby f £ (x,y) defines a certain set of 
basis functions, wherein i is an index defining a cor- 
respondence between each basis function and a corre- 
sponding motion coefficient; 

predicting a motion vector field of a subsequent image 
segment S^ with a set of predicted motion coefficients 
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based upon the previously generated motion coeffi- 
cient vector c k ; 

generating for the subsequent segment S L a set of refine- 
ment motion coefficients r, corresponding to the differ- 
ence between the set of motion coefficients c t and a set 
of predicted motion coefficients p t such that p^+r—c^. 

14. A method according to claim 13, further comprising 
the steps of: 

calculating an approximation matrix E x and an approxi- 
mation vector y L for the subsequent image segment S^; 

calculating a predefined distortion measure of the subse- 
quent segment as a function of E^c^— y L , c L being a 
vector of motion coefficients c, for approximating said 
motion vector field of the subsequent image segment 
together with the set of basis functions; 

generating output parameters including a diagonal output 
matrix R x and an output vector x L . 

15. A method according to claim 14, wherein the motion 
vector field is represented by an affine model. 

16. A method according to claim 14, further comprising 
the steps of: 

reducing the number of motion coefficients for reducing at 
least one motion coefficient in order to generate at least 
two alternative sets of motion coefficients; 
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calculating a cost of for a set of motion coefficients; and 

selecting the set of motion coefficients that results in the 
minimum cost among the alternative sets of motion 
coefficients. 

17. A method according to claim 16, the cost being a 
function of a video coding error and amount of data required 
to code an image segment with the set of motion coefficients. 

18. A method according to claim 14, further comprising 
the step of obtaining matrix R^ by decomposing matrix 
A i =E i r E i into a lower diagonal matrix and its trans- 
pose R x so that A i =R i r R i . 

19. A method according to claim 14, further comprising 
the step of obtaining vector x L by calculating d i =E i r y i and 
solving the set of equations R i r z i =d i . 

20. A method according to claim 14, further comprising 
the steps of obtaining vector z x by 

calculating d i =E i r y i , solving the set of equations 
R^VMl; and 

calculating z i =z i °-R i p i when motion parameter predic- 
tion vector p^ is present. 



